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Wayback Machine Overview 



The Wayback Machine, a service from the Internet Archive and Alexa 
Internet, allows people to access and use archived versions of stored 
websites. Visitors to the Wayback Machine can type in an URL, select a 
date, and then begin surfing on an archived version of the web. The 
Wayback Machine is built so that it can be used and referenced by 
anybody and everybody. 

The original idea for the Wayback Machine began in 1 996, when the 
Internet Archive first began archiving the web. Now, five years later, 
with over 100 terabytes and a dozen web crawls completed, the Internet 
Archive has made the Wayback Machine available to the public. 

The Wayback Machine, which currently contains over 100 terabytes of 
data and is growing at a rate of 12 terabytes per month, is the largest 
known database in the world, containing multiple copies of the entire 
publicly available web. This eclipses the amount of data contained in 
the world's largest libraries, including the Library of Congress 

Read Internet Archive Wayback Machine user testimonials. 

News: The Smithsonian and the Internet Archive have agreed to build a 
new interactive exhibit featuring from the Internet Archive's September 
1 1, 2001 Web and Television archives. 
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"Archive site preserves 
earliest Web pages" 



FAQs 



Which sites are available in the Wayback Machine? 

The Internet Archive is attempting to archive the entire publicly 
available web. Some sites may not be included because the automated 
crawlers were unaware of their existence at the time of the crawl. It's 
also possible that some sites were not archived because they were 
password protected, detected webmaster instructions to not be crawled, 
or were otherwise inaccessible to the Internet Archive's automated 
systems. 

Why are some sites harder to archive than others? 

If you look at the collection of archived sites, you will find some broken 
pages, missing graphics, and some sites that aren't archived at all. The 
Internet Archive has tried to create a complete archive, but has had 
difficulties with some sites, because the link structure was not 
straightforward to crawl. 

Can I link to old pages on the Wayback Machine? 

Yes! The Wayback Machine is built so that it can be used and 
referenced by anybody and everybody. If you find an archived page that 
you would like to reference on your web page or in an article, you can 
copy the URL and share it with others. 

How was the Wayback Machine made? 

Over 100 terabytes of data is stored on a couple hundred modified 
servers situated in the basement of a former military building in the 
Presidio of San Francisco. Alexa Internet, in cooperation with the 
Internet Archive, has designed an index that allows browsing of web 
documents over multiple time periods, and turned this unique feature 
into the Wayback Machine. 

What type of machinery is used in the Wayback Machine? 

The Internet Archive is stored on dozens of slightly modified Hewlett 
Packard and uslab.com servers. The computers run on the FreeBSD and 
Linux operating systems. Each computer has about 512Mb of memory 
and generally holds just over 300 gigabytes of data on IDE disks. 

How can I get my site included in the Wayback Machine? 

Alexa Internet has been crawling the web since 1 996, which has 
resulted in a massive archive. If you have a web site, and you would like 
to ensure that it is saved for posterity in the Alexa Archive, chances are 
that it's already there. We make every effort to crawl the entire publicly 
available web. However, if you wish to take extra measures to ensure 
that we archive your site, you can visit the Alexa "Archive Your Site" 
page at http://pages.alexa.eom/help/webmasters/index.html#crawl_site . 

How can I remove my site from the Wayback Machine? 

To get your site removed, update your robots.txt file to disallow 
ia_archiver. Alexa's crawlers will get your new robots.txt, which will 
make its way into the Wayback Machine and mark all previously 
archived pages inaccessible. 
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